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Abstract 

We study the effect of drift in pure-jump transaction- level models for asset prices 
in continuous time, driven by point processes. The drift is assumed to arise from a 
nonzero mean in the efficient shock series. It follows that the drift is proportional to 
the driving point process itself, i.e. the cumulative number of transactions. This link 
reveals a mechanism by which properties of intertrade durations (such as heavy tails 
and long memory) can have a strong impact on properties of average returns, thereby 
potentially making it extremely difficult to determine growth rates. We focus on a 
basic univariate model for log price, coupled with general assumptions on durations 
that are satisfied by several existing flexible models, allowing for both long memory 
and heavy tails in durations. Under our pure-jump model, we obtain the limiting 
distribution for the suitably normalized log price. This limiting distribution need 
not be Gaussian, and may have either finite variance or infinite variance. We show 
that the drift can affect not only the limiting distribution for the normalized log 
price, but also the rate in the corresponding normalization. Therefore, the drift (or 
equivalently, the properties of durations) affects the rate of convergence of estimators 
of the growth rate, and can invalidate standard hypothesis tests for that growth rate. 
Our analysis also sheds some new light on two longstanding debates as to whether 
stock returns have long memory or infinite variance. 
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1 Introduction 



In recent years, transaction-level data on financial markets has become increasingly avail- 
able, and is now often used to make trading decisions in real time. Such data typically 
consist of the times at which transactions occurred, together with the price at which the 
transaction was executed, and may include other concomitant variables ("marks") such as 
the number of shares traded. Our focus here is on actual transactions rather than quotes, 
but regardless of which type of event is being considered it is important to recognize that 
a useful framework for modeling and analyzing such data is that of marked point processes 
rather than, say, time series in discrete time. Though time series are typically provided for 
further analysis, such as daily (or high frequency) stock returns, these inevitably involve 
aggregation and entail a loss of information that may be crucial for trading and perhaps 
even for risk management and portfolio selection. 

The perspective of asset prices as (marked) point processes has a long history in the 
financial and econometric literature. For example, Scholes and Williams (1977) allowed for 
a compound Poisson process. However, such a model is at odds with the stylized fact that 
time series of financial returns exhibit persistence in volatility. Recent interest in the point 
process approach to modeling transaction-level data was spurred by the seminal paper of 
Engle and Russell (1998), who proposed a model for inter-trade durations. Other work on 
modeling transaction-level data as point processes and/or constructing duration models in- 
cludes that of Bowsher (2007), Bauwens and Veredas (2004), Hautsch (2012), Bacry et al. 
(2011), Deo et al. (2009), Deo et al. (2010), Hurvich and Wang (2010), Aue et al. (2011), 
Shenai (2012), Chen et al. (2012). 

Nevertheless, it must be recognized that time series of asset returns in discrete (say, 
equally-spaced) time are still in widespread use, and indeed may be the only recorded form 
of the data that encompasses many decades. Such long historical series are of importance 
for understanding long-term trends (a prime focus of this paper) and, arguably, for a 
realistic assessment of risk. So given the ubiquitous nature of the time series data but 
also keeping in mind the underlying price-generating process that occurred at the level of 
individual transactions, it is important to make sure that transaction-level models obey 
the stylized facts, not only for the intertrade durations but also for the lower-frequency 
time series. 

It has been observed empirically that time series of financial returns are weakly auto- 
correlated (though perhaps not completely uncorrelated) , while squared returns or other 
proxies for volatility show strong autocorrelations that decay very slowly with increasing 
lag, possibly suggesting long memory (see Andersen et al. (2001)). It is also generally ac- 
cepted that such time series show asymmetries, such as a correlation between the current 
return and the next period's squared return, and this effect (often referred to traditionally 
as the "leverage effect") is addressed for example by the EGARCH model of Nelson (1991). 
The average return often differs significantly from zero based on a traditional i-test, possi- 
bly suggesting a linear trend in the series of log prices. Meanwhile, Deo et al. (2010) found 



2 



that intertrade durations have long memory (this was also found by Chen et al. (2012)), 
and they investigated the possibility that the durations have heavy tails. 

One more fact that we wish to stress is that in continuous time, realizations of series 
of transaction-based asset prices are step functions, since the price is constant unless a 
transaction occurs. Thus, we choose to focus on pure-jump models for the log price (viewed 
as a time series in continuous time), driven by a point process that counts the cumulative 
number of transactions. This is equivalent to a marked point process approach where the 
points correspond to the transaction times and the marks are the transaction-level return 
shocks, which can have both an efficient component and a microstructure component. 

Within this context, we will in this paper investigate the effect of drift (modeled at 
the transaction level) on the behavior of very-long-horizon returns, or equivalently, on the 
asymptotic behavior of the log price as time increases. The drift is assumed to arise from 
a nonzero mean in the efficient shock series. It follows that the drift is proportional to the 
driving point process itself, i.e. the cumulative number of transactions. This link reveals 
a mechanism by which properties of intertrade durations (such as heavy tails and long 
memory) can have a strong impact on properties of average returns, thereby potentially 
making it extremely difficult to determine long-term growth rates or to reliably detect an 
equity premium. 

We focus on a basic univariate model for log price, coupled with general assumptions 
on durations that are satisfied by several existing flexible models, allowing for both long 
memory and heavy tails in durations. Under our pure-jump model (which can capture 
all the stylized facts described above), we obtain the limiting distribution for the suitably 
normalized log price. This limiting distribution need not be Gaussian, and may have 
either finite variance or infinite variance. The diversity of limiting distributions here may 
be considered surprising, since our assumptions imply that the return shocks obey an 
ordinary central limit theorem under aggregation across transactions (i.e. in transaction 
time but generally not in calendar time). We show that the drift can affect not only the 
limiting distribution for the normalized log price, but also the rate in the corresponding 
normalization. Therefore, the drift (or equivalently, the properties of durations) affects the 
rate of convergence of estimators of the growth rate, and can invalidate standard hypothesis 
tests for that growth rate. Our analysis also sheds some new light on two longstanding 
debates as to whether stock returns have long memory or infinite variance. 

The remainder of this paper is organized as follows. In Section 2, we provide a simple 
univariate model for the log price, discuss the trend term and state our first main theorem 
on the limiting behavior of the log price process, as determined by the properties of inter- 
trade durations. In Section 3 we study statistical inference for the trend, and obtain the 
behavior of the ordinary t-statistic under the null hypothesis. We then examine a series 
of examples based on specific duration models that have been proposed in the literature, 
including the ACD model of Engle and Russell (1998) and a generalized form of the LMSD 
model originally proposed in Deo et al. (2010). These examples provide for great diversity 
of the asymptotic distributions of sums of durations and therefore (by our Theorem 2.1 
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below) for the asymptotic distribution of the log price. Section 4 provides a concluding 
discussion on how our results may help to reconcile some longstanding debates. Proofs of 
the mathematical results are provided in Section 5. 

2 A Simple Univariate Model for Log Price 

We start with a basic univariate pure-jump model for a log price series y(t). Let N(t) be 
a point process on the real line. Points of N correspond to transactions on the asset 1 and 
N(t) is the number of transactions in (0,t\. We set y(0) = and define, for t > 0, 

N(t) 

V(0 = I> (2-1) 

k=l 

where = \i + e& and the {e^}, which are independent of N(-), are i.i.d. with zero mean 
and finite variance a\. 

We assume that fi is a nonzero constant. This model with \x = was considered in 
Deo et al. (2009), who showed that it can produce long memory in the realized volatility. 
We therefore have from (2.1), 

N(t) 

y(t) = fiN(t) + J2e k . (2.2) 

k=l 

Since we are modeling the log prices y(t) as a pure-jump process the log price is constant 
when no trading occurs. Unfortunately, the modified version of (2.2) in which fxN(t) is 
replaced by the deterministic time trend ct (where c is a nonzero constant) would not yield 
a pure-jump process. Nevertheless, it is quite reasonable from an economic viewpoint to 
imagine that E[y(t)} is a linear function of t, to account for such phenomena as equity 
premia and inflation. This is indeed the case for Model (2.1) if it is assumed that the point 
process is stationary with intensity A > 0, which implies E[y(t)] = E\jj,N(t)] = fiXt. But 
in actual realizations of y(t), the trend is only impounded when a transaction occurs, due 
to the nonzero mean in {e/c}. 

Denote the transaction times by tk with ■ • ■ t-i < to < < t\ < ti ■ ■ ■ and define 
the durations by Tk = tk — tfc-i- As we will see the properties of the point process N 
can play an important role in determining the (asymptotic) properties of statistics of 
interest. Two distinct modeling approaches seem natural. One is to model the point 
process directly as in Bacry et al. (2011) who use Hawkes processes. Another approach 

1 From a modeling perspective it may be desirable to instead have the points of N correspond to other 
relevant trading events, such as "every fourth transaction", "a transaction that moves the price", etc. For 
simplicity and definiteness in the paper, we simply let N(t) count actual transactions, but our theoretical 
results do not depend on this particular choice of the definition of an event. 
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pioneered by Engle and Russell (1998) consists of modeling the durations as a stationary 
process. They defined the ACD model and Deo et al. (2010) defined the LMSD model. 
We will consider these duration models as examples in this paper. 

Our first theorem demonstrates how the asymptotic distribution of suitably normalized 
sums of durations determines that of the correspondingly normalized log price, under 
the model (2.2). The theorem is a consequence of the CLT equivalence of Whitt (2002, 

Theorem 7.3.1). Proofs of all theorems and lemmas are provided in Section 5. The symbol 

p 

— > below denotes convergence in distribution, and — > denotes convergence in probability. 
Theorem 2.1. Assume that for 7 > 1/2, 

n 

n- 7 X)(r fc -l/A)^ (2.3) 

k=l 

where A is some nonzero random variable. If 7 > 1/2 then n~ 7 (?/(n) — X/in)— > — A. 
Ifj— 1/2, then n~ 1 / 2 {y{n) — A/m)— > — [i\ 3 / 2 A + \f\o~ e Z where Z is a standard Gaussian 
random variable, independent of A. 

We will later make specific assumptions on the durations or on the point process which 
imply the assumption of the theorem and can yield a wide variety of limiting distribu- 
tions A and rates of convergence 7. In particular, A may be normal, non-Gaussian with 
finite variance, or stable with infinite variance. The case where 7 > 1/2 and the limiting 
distribution has finite variance may be indicative of long memory in returns. This possibil- 
ity was discussed by Lo (1991). But in our model, if \i is not very large, any long memory 
phenomena generated by the stochastic drift /xiV(i) may be difficult to detect in a data 
analysis. We will further develop these remarks later. 



3 Statistical inference for the trend 

For integer j, (assuming a time-spacing of 1 without loss of generality) we define the 
calendar-time returns as rj = y(j) — y(j — 1) and the average return over a time period 
of n as f n = n~ x y{n) = n' 1 J2j=i r r Theorem 2.1 implies that, in general, f n — E[f n ] will 
not be O p (n _1,/2 ), making it difficult to accurately estimate growth rates. 

Since our model (under stationarity of N) implies that E[y(t)] = A/it, the growth rate 
per unit time is /i* = Xfi. We therefore consider the problem of statistical inference for fi*. 
We focus on testing a null hypothesis of form H : /i* = /ig based on f n , which is unbiased 
for fi*. The corresponding t-statistic for testing Hq is 

tn = n> ^ ij'n A*o)/ S ™' 

where 

n 

4 = ("-l)- 1 E( r i- f ») 2 - 
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We next establish that s 2 consistently estimates a positive constant, under suitable regu- 
larity assumptions. 

Lemma 3.1. Under the assumptions of Theorem 2.1 if N is stationary and ergodic and 
E[iV 2 (l)] < oo ; then s 2 n -4 /i 2 var(iV(l)) + Xa 2 . 

If 7 > 1/2 it follows from Theorem 2.1 and Lemma 3.1 that if the null hypothesis is 
true and /i ^ then, with er 2 = /i 2 var(iV(l)) + Act 2 , 

n^n-^Hn^ - fi\ 1+ ^A/a . (3.1) 

Thus, t n = O^n 7 ^ 1 / 2 ) and the t-statistic diverges under the null hypothesis if 7 > 1/2. 
Examples where this scenario would occur include durations generated by an ACD model 
with infinite variance, or by an LMSD model with long memory and an exponential volatil- 
ity function. (See Examples 3.1 and 3.2 below). This scenario therefore is consistent with 
the empirical properties of durations found in Deo et al. (2010). 

If 7 = 1/2, then it follows similarly that 

t n ^{-fi\ 3/2 A + \f\a e Z) /a . (3.2) 

So in the case 7 = 1/2 the t-test may be asymptotically correctly sized, but this is only 
possible if A has a normal distribution, i.e. if the durations satisfy an ordinary central limit 
theorem. This would happen, for example, if the durations are i.i.d. with finite variance 
(as would be the case for the Poisson process), or if the durations obey an ACD model 
with finite variance (see Example 3.1 below), but not if A is non-normal, as can happen 
in an example given in Surgailis (2004). Even when 7 = 1/2 and A is normal, t n would 
only be asymptotically standard normal if lim^oo var[(iV(t) — \t)/t 1 ^ 2 ] = variV(l), which 
would hold if N is a Poisson process but would fail if counts are autocorrelated as would 
typically be the case. 

Remark 3.1. We have assumed explicitly in this paper that the true value of li is nonzero. 
Thus, we have excluded in the analysis above the asymptotic behavior of the t-statistic 
when the null hypothesis Hq : fi* = holds. In this case, the first term in (2.2) drops out 
and so does the dependence of the properties of the t-statistic on A. Indeed, it follows from 
the proof of Theorem 2.1 that if 7 > 1/2 and li* = li*q = then t n — >Z, so the t-test remains 
asymptotically correctly sized. Still, the problem of constructing a confidence interval for li* 
would be very difficult since once the possibility that /a* 7^ is entertained the distribution 
of the re-centered statistic n 1 / 2_7 (f ri — fj*)/s n may depend on the parameter of interest 
Li*, as well as on A, which may have any of a wide variety of distributions (unknown a 
priori). It is clear, then, that feasible statistical inference on li* based on f n is difficult 
or impossible in the absence of knowledge of the generating mechanism for the durations 
{t/c} or for the point process N. 

Remark 3.2. An economically-motivated null hypothesis for which lIq is not zero would 
arise if one wished to test whether the expected return for a particular stock exceeds 
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the risk- free rate (assumed fixed and known). One might try to turn the problem into a 
hypothesis test for a zero mean by working with the excess returns (difference between the 
actual return and the fixed risk-free rate). Unfortunately, the t-test for the null hypothesis 
that the expected excess return is zero would in general fail to be asymptotically correctly 
sized, since the asymptotic distribution for the corresponding t-statistic remains exactly as 
in Equations (3.1) and (3.2), with the same value of /i, i.e. the expectation of the return 
shock e*.. In the absence of an equity premium, so that the expected return is equal to 
the risk-free rate, \i in our model would be equal to A -1 times the risk-free rate, so we 
would have \i > even though the expected excess return is zero. The key point is that 
subtracting a linear time trend from both sides of Equation (2.2) does not change /i, and 
therefore does not prevent the t-statistic from having nonstandard asymptotic properties. 
A similar argument would hold if the risk-free rate is taken to be observable and stochastic, 
but then one would also need to make assumptions about the statistical behavior of the 
risk-free rate. 

We next consider two examples of generating mechanisms for the durations: the ACD 
model and a generalized version of the LMSD model. In both cases, the durations are 
assumed to form a stationary process. Unfortunately, it is known (see, for example, 
Nieuwenhuis (1989)) that except for the Poisson process there is no single probability 
measure under which both the durations and the associated counting process are station- 
ary. We refer to the measure under which durations are stationary as the Palm measure, 
denoted by P°. The Palm theory of point processes (see also Baccelli and Bremaud (2003)) 
guarantees the existence, under suitable regularity conditions, of a corresponding measure, 
denoted by P, under which the point process N is stationary. An economic interpretation 
of the Palm duality was provided by Deo et al. (2009). We will use E to denote expectation 
under the P measure, and E°, cov° to denote expectation and covariance under the Palm 
measure, P°. 

Example 3.1 (ACD durations). Assume that under the Palm measure P° the durations 
form a stationary ACD (1,1) process, defined by 

Tk = iiktk, i) k = u + crr fc _i + /S^fe-i, k eZ, (3.3) 

where u > and a, j3 > 0, {^k}'kL-oo * s an i-i-d. sequence with e*. > and -E' [eo] = 1- If a + 
P < 1, there exists a strictly stationary solution determined by Tk = ooek Y^jLi YliZi( ae k-i + 
f3), with finite mean -E°[r ] = lo/(1 — a — f3). Moreover, by (Carrasco and Chen, 2002, 
Proposition 17), if eo has a positive density on [0, oo), then the sequence {r^} is geometri- 
cally /3- mixing. The tail index k of a ACD (1,1) process is the solution of the equation 

E°[(ae + f3r] = l. 

Moreover the stationary distribution satisfies P°(ti > x) ~ cx" K for some positive con- 
stant c. See e.g. Basrak et al. (2002). If 1 < « < 2, then n~ 1/K ELi( r fc -^°N) converges 
to a totally skewed to the right K-stable law. Cf. Bartkiewicz et al. (2011, Proposition 5). 
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A necessary and sufficient condition for -E^Tq] < oo is E°[(aeo+(3) 2 } = a 2 E°[e 2 ] ] + 2af3 + 
(3 2 < 1. Cf. Giraitis and Surgailis (2002, Example 3.3). Under this condition, it also holds 
that 52fcli cov °( t 0j T k) < oo. Since the ACD process is associated (positively dependent), 
the summability of the covariance function implies the central limit theorem for the partial 
sums. See Newman and Wright (1981), Giraitis and Surgailis (2002, Theorem 6.2). 

The previous convergences hold under P°, hence also under the corresponding measure 
P for which the point process is stationary. See Nieuwenhuis (1989) and Deo et al. (2009). 

In order to check the condition E[N 2 (1)] < oo of Lemma 3.1, we must assume that 
E°[tq +1 ] < oo for some q > 4. See Aue et al. (2011, Lemma 4.11). A sufficient condition 
is E°[aeo + f3) q+1 ] < oo. See Carrasco and Chen (2002). This rules out the convergence to 
a stable law. 

Example 3.2 (LMSD durations). Assume that under the Palm measure P°, the durations 
form a stationary LMSD process, defined by = efcCr(Yfc), where {e^, k £ Z} is an i.i.d. se- 
quence of almost surely positive random variables with finite mean and {Y&, k £ Z} is a 
stationary standard Gaussian process, independent of {e^} and a is a positive function. 
Deo et al. (2010) made two assumptions not required here, namely that -E°[e^] < oo and 
that a(Y k ) = exp(n). 

Let r be the Hermite rank of the function o — £ , °[ct(Yo)]- Assume that the covariance 
of the Gaussian process {Yk} is regularly varying at infinity, i.e. 



where H £ (1/2,1). Assume first that E°[e 2 k ] < oo. Denote A" 1 = E°[e ]E°[a(Y )}. The 
following dichotomy is well known. See Embrechts and Maejima (2002, Chapter 3). 

• If r(l -H) < 1/2, then 



where c is a nonzero constant, Ri s h is the standard fractional Brownian motion, and 
for q > 2 such that q(l — H) < 1/2, R q ,H is the so-called Hermite or Rosenblatt 
process of order q and self-similarity index 1 — q(l — H). 



p n = cov°(Yo, Yn) = i(n)ri 



2H-2 




• If r(l -H) > 1/2, then 



n 



n 



k=l 



with S a positive constant. 



S 



We thus see that (2.3) may hold with 7=1 — t(1 — H) > 1/2 in the first case and a possibly 
non Gaussian limit, and 7 = 1/2 with a Gaussian limit in the latter case. Consider now 
the case where e& has infinite variance. Assume then that 

P°(e k >x) = L(x)x~ a , 

with a G (1,2) and L a slowly varying function. It is then shown in Kulik and Soulier 
(2012) that a similar dichotomy exists. 

• If r(l -H) < 1/a, then 

M 

k=l 

• If r(l -H) > 1/a, then 

M 

n-^M^frt-A- 1 ) L a , 

k=l 

where L a is a totally skewed to the right a-stable Levy process. 

We thus see that (2.3) may hold with 7 = 1 — r(l — H) > 1/2 in the first case and a 
possibly non Gaussian limit, and 7 = 1/a with a stable non Gaussian limit in the latter 
case. These convergences hold under P°, hence also under P. 

In order to check the condition E[N 2 (1)] < 00 of Lemma 3.1, we must assume that 
E°[e q +1 ] < 00 for some q > 2/(1 - H). See Aue et al. (2011, Lemma 4.11). This rules out 
the convergence to a stable law. 

4 Discussion: Long Memory and Heavy Tails of Stock 
Returns 

The introduction of a nonzero mean in the efficient shocks in the model (2.1) provides a 
link by which properties of intertrade durations can affect those of certain quantities that 
are observed at a macroscopic level. We have focused so far on inference for the trend 
(based on studying the asymptotic distribution of the log price). To illustrate just one 
of the variety of possible additional quantities of interest, we now turn our attention to 
properties of returns. 

Lo (1991) investigated whether stock returns have long memory, and Mandelbrot (1963) 
argued that returns have infinite variance. Both of these propositions have met with 
considerable controversy, but under the model (2.2) both could contain an important grain 
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of truth. Generalizing the analysis presented so far leads to a more nuanced interpretation 
of what these propositions could mean. 

From here on in this section, when we mention sequences of random variables, we allow 
for suitable renormalization (centering and scaling) without always specifically mentioning 
or writing the renormalization. So the discussion here is somewhat informal, but can be 
made mathematically rigorous. We focus here on the case 7 > 1/2. 

Theorem 2.1 implies that partial sums of returns (after suitable renormalization) con- 
verge in distribution to a random variable that need not be Gaussian. This theorem has 
allowed us to discuss issues related to inference for the slope parameter, which is the ex- 
pectation of the average return. It is also of interest to ask if one can go further and say 
something about the joint distribution of the returns themselves, rather than their sum. 
By making the stronger assumption of functional convergence of partial sums of durations 
to a limiting stochastic process (which is the case for all examples considered in this paper) 
then it is indeed possible to discuss the joint limiting distribution of any fixed number of 
contiguous returns at long horizons. 

Although we have so far taken the time spacing in defining the returns to be 1, there 
is no essential reason for this and here we replace it by an arbitrary T > 0, and we define 
the returns with respect to this time spacing as r^T = y{jT) — y((j — 1)T). Now consider 
the first M of these returns, where M is fixed. It follows from our assumptions here (by 
arguments similar to the proof of Theorem 2.1 and by Theorem 7.3.2 of Whitt (2002)) 
that the joint distribution of these M returns (after suitable renormalization) converges as 
T —7- 00 to the distribution of M contiguous increments of the limiting process. 

In our LMSD example, assuming finite variance and an exponential volatility function, 
the limiting process is fractional Brownian motion. Thus in this case, the M returns 
converge in distribution to M contiguous observations of a fractional Gaussian noise. In this 
sense, it could be said that the returns (computed at a sufficiently high level of aggregation) 
have long memory. Simulations not shown here of the model (2.2) in this LMSD case show 
that it may be hard to detect this long memory due to the additive noise that arises from 
the second term on the righthand side of (2.2). 

In the ACD example (and certain cases of the LMSD example as well), it turns out that 
the limiting process can be a stable process, for which the increments are independent and 
have infinite-variance stable distributions. So here, our M long-horizon returns converge 
in distribution (as T — > 00, and after suitable renormalization) to a sequence of M i.i.d. 
stable random variables. This would seem to correspond to the proposition that returns 
have infinite variance. But actually, the truth here may be more subtle. It can happen 
that, for each fixed T the variance of the returns is finite. See, for example, Daley et al. 
(2000) for the underlying point process theory under heavy-tailed durations. It is even 
possible to construct an example where durations have finite variance and still the limit 
of partial sums of durations is a stable process, so the returns would once again have 
finite variance but converge in distribution to i.i.d. stable random variables with infinite 
variance. Such an example may come from durations that obey a positive version of the 
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renewal-reward process discussed in Levy and Taqqu (1991) (see also Hsieh et al. (2007)). 
In such a model, durations would have finite variance but their sums would converge to a 
process with infinite variance. 

In the case where the limiting process is a Levy-stable process, it is of interest to note 
that such continuous-time processes have discontinuities with probability 1. These may 
correspond to what practitioners refer to as "jumps" in the log price process, even though 
under our model the log price process is a pure-jump process so that all activity consists 
of jumps. 

The main message of this paper is that even in the simple transaction-level model 
(2.1) there is a wide variety of possible behaviors of macroscopic quantities of interest. 
Some additional quantities we hope to study in future work based on this and similar 
models include: regression coefficients as used in the market model, estimated cointegrating 
parameters (which were considered without a trend term in Aue et al. (2011)), and sample 
autocorrelations. 



5 Proofs of Mathematical Results 

In this section, we prove the results of the previous sections in a more general framework. 
Specifically, we introduce a microstructure noise term which may be dependent on the 
counting process N, thereby allowing for leverage effects. In the mathematical theory 
presented in this paper, all random variables and stochastic processes are defined on a 
single probability space (Q, J 7 , P). Expectation with respect to P will be denoted by E and 

var and cov will denote the variance and covariance with respect to P. Convergence in 

p 

P-probability will be denoted by — > , convergence in distribution under P of sequences of 
random variables will be denoted by — >. We use =r- to denote weak convergence under 
P in the space T>([0, oo)) of left-limited, right-continuous (cadlag) functions, endowed with 
Skorohod's Ji topology. See Billingsley (1968) or Whitt (2002) for details about weak 
convergence in T>([0, oo)). Whenever the limiting process is continuous, this topology can 
be replaced by the topology of uniform convergence on compact sets. The model is now 
described as follows. 

N(t) 

y(t) = fiN(t) + J2{ek + Vk} , (5.1) 

k=l 

where the sequence {t]k} satisfies the assumption 

M 

k=l 

Theorem 5.1. Under (2.3) and (5.2), if j > 1/2, then n-"<(y(n) - A/m)-> - /iA 1+7 A // 
7 = 1/2, then n -1 / 2 (y(n) — A/m) — > — fi\ 3 ^ 2 A + \f\a e Z , where Z is a standard Gaussian 
random variable, independent of A. 
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Theorem 2.1 follows from Theorem 5.1 by taking r/k = 0. 



Proof of Theorem 5.1. Write 



N(n) N(n) 

y(ri) - Xfin = A*{iV(n) - \n} + ^ e k + ^ % 

fe=i fc=i 

The assumption on the durations implies that n^ 1 XH=i r fc — ^ an d by the CLT equiva- 
lence, they imply that n~~* fi(N(n) — n\)— > — /iA +7 A Thus it also holds that N{n)/n — > A. 
Denote x(n) = Ylk=i e k + Ylk=i Vk- ft is proved in Aue et al. (2011) that n~ l / 2 x([n-]) =>- 
y/\a e B(l), where B is a standard Brownian motion, hence n~ 1 ^ 2 x(n) — >N(0, Xa 2 ). Note 
moreover that n-V 2 YZ )r lk->0, and 

since {e^} is independent of N, n 1 / 2 x(n) and 
n~ 7 (A r (n) — nA) converge jointly. □ 

Lemma 5.1. Assume that the marked point process with marked points itk^kiVk) i> s sta- 
tionary and ergodic. Assume moreover that (2.3) and (5.2) hold, and 



E 



^(/i + e fc + r] k ) 



k=\ 



< oo 



Then, there exists a > such that s 2 — > o~ 2 . 
Proof of Lemma 5. 1 . 

1 n n 

s 2 n = Vfo - A/i) 2 + -(f n - A/i) 2 

n — 1 n — 1 



By Theorem 5.1, the second term is op(l). Since 



r- - A/i = ^ (/i + e k + r/ fc ) - //A 

fc=JV(j'-l)+l 



by ergodicity of the marked point process, we have 
1 ™ 



JV(1) 

^(/Lt + e fc + %) - A// 
fe=i 



(5.3) 



□ 
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Remark 5.1. If {tk,Zk,k e Z} are the (marked) points of a stationary (under P) marked 
point process with finite intensity A, then, by Baccelli and Bremaud (2003, Formula 1.2.9), 
for alH > 



E 



N(i) 
k=l 



XtE°[z ] 



where E° is the expectation with respect to the Palm probability P°. If the marks {zk} 



have zero mean under the Palm measure P°, then E Ylk=i Zk 
E[y(t)] = fM[N(t)) = A/it, even if, under P, it might happen that E[z ) = XE°[tiZi] ^ 0. 

Remark 5.2. Since the sequence {ek} has zero mean, finite variance and is independent of 
the point process N, if E[iV 2 (l)] < oo , then E^fj^/x + e k )} 2 ] < oo. Next, for r, s such 
that 1/r + l/s = 1, 



0, and thus the trend is 



{N(l)=k} 





[Mi) \ 2 1 


oo 


E 


AS* . 


= E E 






k=l 



<E El 

k=l 





2 




1 






k 


2s" 






i=i 





P 1/r (iV(l) = k) . 



Assume now that there exists some constant C s such that 



E 



(5.4) 



then, for q > 0, by Holder's inequality, 

2- 



E 




l/s 



< C^fcP 1/r (iV(l) = fc) < C [Ys k ~ qS E 1/r [iV (9+1)r (l)] . 



k=l 



,fe=l 



If we can choose g such that qs > 1 and E[JV^ +1 ) r (l)] < oo, then the right hand side is 
finite and thus (5.3) holds. 

Example 5.1 (Leverage) . We now provide a specific example of a microstructure noise series 
{rjk} that is dependent on N. Assume that under the Palm measure P°, the durations form 
an LMSD sequence = e^e 1 * as in Example 3.2, with memory parameter d T = H T — 1/2 e 
(0, 1/2). Assume moreover that the spectral density fy of the Gaussian process Y satisfies 
fy(x) = |1 — e lx \~ 2dlT h(x), where the function h is slowly varying at 0. Let now rj k be 
defined as follows: 



Vk = [(I-B) 5 Y]k 
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and S is such that S G (d T ,oo) where B is the backshift operator. Note that -E 10 ^] — 0. 
The spectral density of the weakly stationary sequence {f]k} is given by 

fn(x) = |1 - e ix \ 25 f Y (x) = |1 - e ia f 5 - 2( H(x) . 

Thus the sequence {r] k } has negative memory d v = d T — 5 and Assumption (5.2) holds. 
There remain to check Condition (5.3). Since (5.4) holds for all s > 1, q and r can be 
chosen arbitrarily close to and 1, respectively, and thus (5.3) holds if E[iV 2 (l)] < oo. If 
S is chosen to be in (d T , d T + 1/2), then d v G (—1/2, 0). And if S — 1, then 77^ = Yk — Yk-i- 
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